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Abstract. We consider the regular model of formula generation in conjunctive 
normal form (CNF) introduced by Boufkhad et. al. in [6]. In [6], it was shown that 
the threshold for regular random 2-SAT is equal to unity. Also, upper and lower 
bound on the threshold for regular random 3-SAT were derived. Using the first 
moment method, we derive an upper bound on the threshold for regular random 
k-SAT for any k > 3 and show that for large k the threshold is upper bounded by 
2*ln(2). We also derive upper bounds on the threshold for Not-All-Equal (NAE) 
satisfiability for k > 3 and show that for large k, the NAE-satisfiability threshold 
is upper bounded by 2 i_1 ln(2). For both satisfiability and NAE-satisfiability, the 
obtained upper bound matches with the corresponding bound for the uniform 
model of formula generation [1,9]. 

For the uniform model, in a series of break through papers Achlioptas, Moore, 
and Peres showed that a careful application of the second moment method yields 
a significantly better lower bound on threshold as compared to any rigorously 
proven algorithmic bound [1,3]. The second moment method shows the existence 
of a satisfying assignment with uniform positive probability (w.u.p.p.). Thanks to 
the result of Friedgut for uniform model [10], existence of a satisfying assign- 
ment w.u.p.p. translates to existence of a satisfying assignment with high prob- 
ability (w.h.p.). Thus, the second moment method gives a lower bound on the 
threshold. As there is no known Friedgut type result for regular random model, 
we assume that for regular random model existence of a satisfying assignments 
w.u.p.p. translates to existence of a satisfying assignments w.h.p. We derive the 
second moment of the number of satisfying assignments for regular random k- 
SAT for k > 3. There are two aspects in deriving the lower bound using the sec- 
ond moment method. The first aspect is given any k, numerically evaluate the 
lower bound on the threshold. The second aspect is to derive the lower bound as 
a function of k for large enough k. We address the first aspect and evaluate the 
lower bound on threshold. The numerical evaluation suggests that as k increases 
the obtained lower bound on the satisfiability threshold of a regular random for- 
mula converges to the lower bound obtained for the uniform model. Similarly, 
we obtain lower bounds on the NAE-satisfiability threshold of the regular ran- 
dom formulas and observe that the obtained lower bound seems to converge to 
the corresponding lower bound for the uniform model as k increases. 
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1 Regular Formulas and Motivation 



A clause is a disjunction (OR) of k variables. A formula is a conjunction (AND) of a 
finite set of clauses. A fc-SAT formula is a formula where each clause is a disjunction of 
k literals. A legal clause is one in which there are no repeated or complementary literals. 
Using the terminology of [6], we say that a formula is simple if it consists of only legal 
clauses. A configuration formula is not necessarily legal. A satisfying (SAT) assignment 
of a formula is a truth assignment of variables for which the formula evaluates to true. 
A Not-All-Equal (NAE) satisfying assignment is a truth assignment such that every 
clause is connected to at least one true literal and at least one false literal. We denote 
the number of variables by n, the number of clauses by m, and the clause density, i.e. 
the ratio of clauses to variables, by a — — . We denote the binary entropy function by 
h(-), h{x) = — xln(x) — (1 — x)ln(l —x), where the logarithm is the natural logarithm. 

The popular, uniform k-SAT model generates a formula by selecting uniformly and 
independently m-clauses from the set of all 2*(?) fe-clauses. In this model, the literal 
degree can vary. We are interested in the model where the literal degree is almost con- 
stant, which was introduced in [6]. Suppose each literal has degree r. Then 2nr — km, 
which gives a = 2r/k. Hence a can only take values from a discrete set of possible val- 
ues. To circumvent this, we allow each literal to take two possible values for a degree. 
For a given a, let r = ^ and r = [rj . Each literal has degree either r or r + 1 . Also 
a literal and its negation have the same degree. Thus, we can speak of the degree of a 
variable which is the same as the degree of its literals. Let the number of variables with 
degree d be n<i, d € {r, r + 1 }. Let X\ , . . . 7 X„ r be the variables which have degree r and 
X„ T+ \ , . . . ,X„ be the variables with degree r + 1 . Then, 
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n T = n + rn- 

and n r +n r +i = n. As we are interested in the asymptotic setting, we will ignore the 
floor in the sequel. We denote the fraction of variables with degree r (resp. r + 1) by 
A r (resp. A T+ i) which is given by 

a -> ka , ka 

A T = l+r-—, A I+l = — -r. (1) 

When A r or A T+ \ is zero, we refer to such formulas as strictly regular random formulas. 
This implies that there is no variation in literal degree. If A r , A r+ i > 0, then we say that 
the formulas are 2-regular random formulas. 

A formula is represented by a bipartite graph. The left vertices represent the literals 
and right vertices represent the clauses. A literal is connected to a clause if it appears in 
the clause. There are kan edges coming out from all the literals and kan edges coming 
out from the clauses. We assign the labels from the set £ = {1, . . . ,kan} to edges on 
both sides of the bipartite graph. In order to generate a formula, we generate a random 
permutation 77 on £ . Now we connect an edge i on the literal node side to an edge 
n(i) on the clause node side. This gives rise to a regular random k-SAT formula. Note 
that not all the formulas generated by this procedure are simple. However, it was shown 
in [6] that the threshold is the same for this collection of formulas and the collection 



of simple formulas. Thus, we can work with the collection of configuration formulas 
generated by this procedure. 

The regular random k-S AT formulas are of interest because such instances are com- 
putationally harder than the uniform fc-SAT instances. This was experimentally ob- 
served in [6] , where the authors also derived upper and lower bounds for regular random 
3-SAT. The upper bound was derived using the first moment method. The lower bound 
was derived by analyzing a greedy algorithm proposed in [ 1 3] . To the best of our knowl- 
edge, there are no known upper and lower bounds on the thresholds for regular random 
formulas fork > 3. 

Using the first moment method, we compute an upper bound a* u on the satisfiability 
threshold a* for regular random formulas for k > 3. We show that a* < 2 k ln(2), which 
coincides with the upper bound for the uniform model. We also apply the first moment 
method to obtain an upper bound a* NAE on the NAE-satisfiability threshold a* AE of 
regular random formulas. We show that a* AE < 2 i ~ 1 ln(2) which coincides with the 
corresponding bound for the uniform model. 

In order to derive a lower bound a* t on the threshold, we apply the second moment 
method to the number of satisfying assignments. The second moment method shows 
the existence of a satisfying assignment with uniform positive probability (w.u.p.p.). 
Due to the result of Friedgut for uniform model [10], existence of a satisfying assign- 
ment w.u.p.p. translates to existence of a satisfying assignment with high probability 
(w.h.p.). Thus, the second moment method gives lower bound on the threshold for uni- 
form model. As there is no known Friedgut type result for regular random model, we as- 
sume that for regular random model existence of a satisfying assignments w.u.p.p. trans- 
lates to existence of a satisfying assignments w.h.p. This permits us to say that second 
moment method gives valid lower bound on the threshold. We compute the second mo- 
ment of the number of satisfying assignments for regular random model. Similar to the 
case of the uniform model, we show that for the second moment method to succeed the 
term corresponding to overlap n/2 should dominate other overlap terms. We observe 
that the obtained lower bound a] converges to the corresponding lower bound of the 
uniform model, which is 2*ln(2) — (k+ 1)^^ - 1 as k increases. Similarly, by com- 
puting the second moment of the number of NAE-satisfying assignments we obtain that 
a/,NAE converges to the corresponding bound 2 A: ~ 1 ln(2) — 0(1) for the uniform model. 
The lower bounds are not obtained explicitly as computing the second moment requires 
finding all the positive solutions of a system of polynomial equations. For small values 
of k, this can be done exactly. However, for large values of k we resort to a numerical 
approach. Our main contribution is that we obtain almost matching lower and upper 
bounds on the satisfiability (resp. NAE-satisfiability) threshold for the regular random 
formulas. Thus, we answer in affirmative the following question posed in [1]: Does the 
second moment method perform well for problems that are symmetric "on average"? 
For example, does it perform well for regular random k-SAT where every literal appears 
an equal number of times? . 

In the next section, we obtain an upper bound on the satisfiability threshold and 
NAE satisfiability threshold. 



2 Upper Bound on Threshold via First Moment 



Let X be a non-negative integer- valued random variable and E(X) be its expectation. 
Then the first moment method gives: P(X > 0) < E(X). Note that by choosing X to 
be the number of solutions of a random formula, we can obtain an upper bound on the 
threshold a* beyond which no solution exists with probability one. This upper bound 
corresponds to the largest value of a at which the average number of solutions goes to 
zero as n tends to infinity. In the following lemma, we derive the first moment of the 
number of SAT solutions of the regular random A:-SAT for k > 3. 

Lemma 1. Let N(n,a) (resp. N NAE (n,a)) be the number of satisfying (resp. NAE satis- 
fying) assignments for a randomly generated regular k-SAT formula. Then 4 , 

E W"'«»= 2 "W coef ((^)°"^^)- (2) 

where 

p(x) = (l+x) k -l, p mE {x) = {\+x) k -\-x k , (4) 

andcoef (^p(x) an ,x^^j denotes the coefficient of it in the expansion of p(x) an . 

Proof. Due to symmetry of the formula generation, any assignment of variables has 
the same probability of being a solution. This implies 

E{N{n,a)) = 2"P (X = {0, . . . ,0} is a solution) . 

The probability of the all-zero vector being a solution is given by 

P (X = {0, . . . , 0} is a solution) = 

Number of formulas for which X = {0, . . . , 0} is a solution 
Total number of formulas 

The total number of formulas is given by (ton)!. The total number of formulas for 
which the all-zero assignment is a solution is given by 

((^)!) 2 coef(,(xr,^). 

The factorial terms correspond to permuting the edges among true and false literals. 
Note that there are equal numbers of true and false literals. The generating function 
p(x) corresponds to placing at least one positive literal in a clause. With these results 
and observing that 




4 We assume that lean is an even integer. 



we obtain (2). The derivation for E(N NAE (n,a)) is identical except that the generating 
function for clauses is given by p NAE (x). □ 

We now state the Hayman method to approximate the coef-term which is asymptot- 
ically correct [11]. 

Lemma 2 (Hayman Method). Let q(y) = Y,i<liy' be a polynomial with non-negative 
coefficients such that qoj^O and q\ ^ 0. Define 



dq(y) 1 , , N dajy) 

Then, 



coef {q(yy,yn = — - -%H=_(1 + (l)), (6) 

where y w is the unique positive solution of the saddle point equation a q {y) = uj. 

We now use Lemma 2 to compute the expectation of the total number of solutions. 

Lemma 3. Let N(n,a) (resp. N NAE (n,a)) denote the total number of satisfying (resp. 
NAE satisfying) assignments of a regular random k-SAT formula. Let q(x) = 
Qnae{x) = ^j^, where p(x) and p NAE (x) is defined in (4). Then, 



E(N(n,a)) = /_^ e »(M2)-^ln(2)+aln(^))-(^- Q )lnfe)) (1 +o(1))j (?) 

E(N mE (n,a)) = ^ (l+o(l)), (8) 

V ™qNAE\ X k,NAE) 

where Xk (resp. x^^) is the positive solution of a q (x) = | — 1 (resp. a qmE (x) = | — I). 
The quantity a q (x), a qmE (x), b q (x), and b qmE (x) are defined according to (5). 

In the following lemma we derive explicit upper bounds on the satisfiability and NAE 
satisfiability thresholds for k > 3. 

Lemma 4 (Upper bound). Let a* (resp. a* AE ) be the satisfiability (resp. NAE satisfia- 
bility) threshold for the regular random k-SAT formulas. Define a* (resp. a* NAE ) to be 
the upper bound on a* (resp. ot* NAE ) obtained by the first moment method. Then, 

a* < a* u < 2*ln(2)(l + <*(!)), a* NAE < < M£ = 2*" 1 ln(2) - ^ -o k (\). (9) 



Proof. We observe that the solution x^ of the saddle point equation a q (x) = | — 1 
satisfies: x^ = argmin^Q-^p-, where a q (x) is defined according to (5). This implies that 
we obtain the following upper bound on the growth rate of E(N(n,a))) for any x > 0, 

lim ME(N(n,a))) ^ _ fealn(2 ) + aln(g(jc)) - ( ^ - aV(jc). (10) 



We substitute x = 1 — in (10). Then we use the series expansion of ln(l — x), l/i> 
1 /2\ and — 1/z > —1 to obtain the following upper bound on the threshold, 

»<- 2*ln(2) 

The summation of the last three terms in the denominator of (1 1) is positive. This can be 
easily seen for k > 8. For 3 < k < 8, it can be verified by explicit calculation. Dropping 
this summation in (1 1), we obtain the desired upper bound on the threshold. To derive 
the bound for NAE satisfiability, we note that X£, NAE = 1 for k > 3. By substituting this 
in the exponent of E(N NAE ) and equating it to zero, we obtain the desired expression for 
a*,. „.-. □ 

W,NAE 

In the next section we use the second moment method to obtain lower bounds on 
the satisfiability and NAE satisfiability thresholds of regular random k-S AT. 



3 Second Moment 

A lower bound on the threshold can be obtained by the second moment method. The 
second moment method is governed by the following equation 

P(*>0)>|g. (12) 

In this section we compute the second moment of N(n,a) and N KAE (n,a). Our computa- 
tion of the second moment is inspired by the computation of the second moment for the 
weight and stopping set distributions of regular LDPC codes in [14, 15] (see also [4]). 
We compute the second moment in the next lemma. 

Lemma 5. LetN{n,a) be the number of satisfying solutions to a regular random k-SAT 
formula. Define the function /(xi,X2,*3) by 

f(xi,x 2 ,x 3 ) = (1 +xi +x 2 +x 3 ) k - (1 +xi) k - (1 +x 3 ) k + 1. (13) 

If the regular random formulas are strictly regular, then 

E(N(n,a) 2 ) = 

n / \ ((r(n - /))!) 2 ((ri)!) 2 coef (/(xi.xa.xs) ",^"" ^'*?"" ) 
E 2 " • 7T^\ • (14) 

If the regular random formulas are 2-regular, then 

i r =0i r+ i=0 V' r / W z 

((^ + (' + l)^l)') 2 coef / / { 3)CT f{xix 3) ^- r , r - (r+ l), r+ljc n r+(r+1 ), r+ A 

(kan)\ \ ! 

(15) 



For both the strictly regular and the 2-regular case, the expression for E (N NAE {n,a) 2 ) is 
the same as that for E (N(n,a) 2 ) except replacing the generating function f{x\,X2,x 3 ) 
by f mE { x i, x 2, x 3), which is given by 



f N AE{xux 2 ,x 3 ) = (1 +x\ + x 2 +x 3 ) k - 



((1 +xi) k + (1 +x 3 ) k - 1 + fa +x 2 ) k -x\ + (x 2 +x 3 ?-4-4) ■ (16) 



Proof. Let Ixy be the indicator variable which evaluates to 1 if the truth assignments 
X and Y satisfy a randomly regular k-SAT formula. Then, 



The last simplification uses the fact that the number of formulas which are satisfied by 
both X and Y depends only on the number of variables on which X and Y agree. Thus, 
we fix X to be the all-zero vector. 

We now consider the strictly regular case. The probability that the all-zero truth 
assignment and the truth assignment Y both are solutions of a randomly chosen regular 
formula depends only on the overlap, i.e., the number of variables where the two truth 
assignments agree. Thus for a given overlap i, we can fix Y to be equal to zero in the 
first i variables and equal to 1 in the remaining variables. This gives, 



In order to evaluate the probability that both and Y are solutions for a given overlap 
i, we observe that there are four different types of edges connecting the literals and the 
clauses. There are r(n — i) type 1 edges which are connected to true literals w.r.t. the 
truth assignment and false w.r.t. to the Y truth assignment. The ri type 2 edges are 
connected to true literals w.r.t. both the truth assignments. There are r(n — i) type 3 
edges which are connected to false literals w.r.t. the truth assignment and true literals 
w.r.t. to the Y truth assignment. The ri type 4 edges are connected to false literals 
w.r.t. both the truth assignments. Let f(xi,X2,xs) be the generating function counting 
the number of possible edge connections to a clause, where the power of x, gives the 
number of edges of type i, ; G {1,2,3}. A clause is satisfied if it is connected to at least 
one type 2 edge. Otherwise, it is satisfied if it is connected to at least one type 1 and 
at least one type 3 edge. Then the generating function f(xi,X2,x 3 ) is given as in (13). 
Using this, we obtain 

P (0 and Y are solutions) = 



E(N(n,a) 2 ) 



£ £(1xy)=2" £ P (0 and Y are solutions) . 

x,Ye{0A}" fg{0,1}" 



E(N(n, a) 2 ) = £ 2" ( " J P (0 and Y are solutions) . 



(17) 




(18) 



(kan)\ 



where (kan) \ is the total number of formulas. Consider a given formula which is satis- 
fied by both truth assignments and Y. If we permute the positions of type 1 edges on 



the clause side, we obtain another formula having and Y as solutions. The argument 
holds true for the type i edges, i e {2,3,4}. This explains the term (r(n — ;'))! in (18) 
which corresponds to permuting the type 1 edges (it is squared because of the same 
contribution from type 3 edges). Similarly, (r/)! 2 corresponds to permuting type 2 and 
type 4 edges. Combining (17) and (18), we obtain the desired expression for the second 
moment of the number of solutions as given in (14). 

We now consider the two regular case. Note that in this case the equivalent equation 
corresponding to (17) is 

E(N(n,a) 2 )=f J "f 2" f" r V" r+1 ") P ((0,7) is a solution) , (19) 
; r =o; r+1 =o V'r/Vr+l/ 

where i r (resp. z r +i) is the variable corresponding to the overlap between truth assign- 
ments and Y among variables with degree r (resp. r + 1). Similarly, the equivalent of 
(18) is given by 

P(0 and Y are solutions) = ((r(n r — ; r ) + (r + l)(n r+ i — / r +i))!) 2 

x ((r/ r + (r + l)/ r+1 )!) 2 
(kan)l 

x coef (/(x!,X2,x 3 ) Q ", (xix 3 ) r( " r - ,r)+(r+1)( " r + 1 - ,r + l) X2 r+(r+1) ' r+1 ) . (20) 

Combining (19) and (20), and observing that rn r + (r + l)n r +i = ^ , we obtain (15). 
The derivation of E (N NAE (n,a) 2 ) is identical except the generating function for NAE- 
satisfiability of a clause is different. This can be easily derived by observing that a clause 
is not NAE-satisfied for the following edge connections. Consider the case when a 
clause is connected to only one type of edge, then it is not NAE-satisfied. Next consider 
the case when a clause is connected to two types of edges. Then the combinations of 
type 1 and type 4, type 3 and type 4, type 1 and type 2, or type 2 and type 3 do not NAE- 
satisfies a clause. This gives the generating function f NAE (x\ ,X2,x$) defined in (16). □ 

In order to evaluate the second moment, we now present the multidimensional sad- 
dle point method in the next lemma [5]. A detailed technical exposition of the multidi- 
mensional saddle point method can be found in Appendix D of [18]. 

Theorem 1. Let [:— (ii,i2,h), j '■= (71,72,73), andx — (xi,X2,X3) 
< lim z'i/n, < lim z'2/n, < lim i^/n. 

n— 7-°° n— «— p-oo 

Let further f(x) be as defined in (13) and t_= (fi,?2,f3) be a positive solution of the 
saddle point equations af (x) = |x; dM/(*^*2,*3)) | _ Then coef (/(x)"",^ 1 ) can 
be approximated as , 

f( t \an 

coef (/(xr = — U (1 + (1)), 

(?)V(27ran) 3 \B(t_)\ 

using the saddle point method for multivariate polynomials, where B(x) is a 3x3 matrix 
whose elements are given by Bij = x .^Hf^^lflL — Bj j andafi(x) is the i' h coordinate 



ofaf(x). Also, coef (/(*)"",*-) can be approximated in terms o/coef (f{x) an ,x L ). This 
approximation is called the local limit theorem of j around i. Explicitly, ifu : = ^= (j — 

I) and \\u\\ = 0((lnn)3), then 

coef(f(x) an ,xL) = fi-iexp -B(t_)- 1 ■ u T ^j coef (f{x) an ,xp) (1 +o(l)). 

Because of the relative simplicity of the expression for the second moment, we 
explain its computation in detail for the strictly regular case. Then we will show how 
the arguments can be easily extended to the 2-regular case. The derivation for the NAE- 
satisfiability is identical for both cases. 

Theorem 2. Consider the strictly regular random k-SAT model with literal degree r. 
Let S(i) denote the i th summation term in (14), and 7 = i/n. IfS(n/2) is the dominant 
term i e 

Um tMl)> lta ,!lfe!)), 7 e[<U], 7 4 

n— n n— >oo yi 2j 
then with positive probability a randomly chosen formula has a satisfying assignment, 
i.e. 

2 \ \Bf(x k ,x 2 k ,x k )\ 
limP(iV(a,tt)>0)> V , (22) 

n ^°° cj s b g (x k )Vk 

where x k is the solution of the saddle point equation a q (x) = | — 1 defined in Lemma 3, 
a q (x) and bq(x) are defined according to (5), Bf(xk,xi,xic) is defined as in Theorem 1, 
and the "normalized variance " u 1 of the summation term around S (|) is given by 

a * = 4+^(l-l,l-l]-B f (x k ,xlx K )-i-[-l,l-l]T)-8r ' ^ 

Let r* be the largest literal degree for which S(n/2) is the dominant term, i.e. (21) 
holds, then the threshold a* is lower bounded by a* > a\ = ^j- . 

Proof. From (14) and Theorem 1, the growth rate of S(-fn) is given by, 
,( 7 ) 4 lim ln( ^ (7w)) = 

n 

(1 - fo)(ln(2) + A( 7 )) + aln(/(fi,f 2 ,f 3 )) - r(l - 7 )(ln(n) + ln(r 3 )) - r 7 ln(f 2 ), 

(24) 

where 1 1,^2,^3 is a positive solution of the saddle point equations as defined in Theo- 
rem 1, 



a f (t_) 4 j. 



3ln(/(fi,f 2 ,f 3 )) 3ln(/(fi,f 2 ,f 3 )) d\n(f(t u t 2 ,h)) \ _ 
3*1 ' 2 dt 2 ' 3 3f 3 J 



k , k k , ■ 

2(1 -7), ^7, 2^ --»!>■ (25) 



In order to compute the maximum exponent of the summation terms, we compute its 
derivative and equate it to zero, 



dS<y1 ^ - (1 - to) In ( ) +rln(fi)-rln(f 2 )+rln(f 3 ) = 0. (26) 



dj V 7 

Note that the derivatives of t\ , t 2 and r 3 w.r.t. 7 vanish as they satisfy the saddle point 
equation. Every positive solution (t\ , t 2 , £3 ) of (25) satisfies t\ = £3 as (25) and f(t 1 , t 2 > h ) 
are symmetric in t\ and f 3. If 7 = 1/2 is a maximum, then the vanishing derivative 
in (26) and equality of t\ and £3 imply t 2 = t\. We substitute 7 = 1/2, t\ = h, and 
?2 = t\ in (25). This reduces (25) to the saddle point equation corresponding to the 
polynomial q(x) defined in Lemma 3 whose solution is denote by x k . Then by observing 
f{x k ,x\,x k ) = p(x k ) 2 , we have 

f.3/2 

S(n/2) = = e n(2]n(2)(l-ka)+2g]n(p(x k ))-kaHx k ))^ 

2 1 / 2 y /TTn^J\Bf(x k ,x 2 k ,x k )\ 

(27) 

Using the relation that q(x) = we note that the exponent of S(n/2) is twice the 
exponent of the first moment of the total number of solutions as given in (7). In order 
to compute the sum over S(-fn), we now use Laplace's method, a detailed discussion of 
which can be found in [7,8, 12]. We want to approximate the term S(n/2 + Ai) in terms 
of S(n/2). For the coef terms, we make use of the local limit theorem given in Theorem 
1 and for the factorial terms we make use of Stirling's approximation. This gives, 



,•2 



S(n/2 + Ai)=S(n/2)e ^J(l+o(l)), where Ai = 0{n x ^ln{n) x l y ). (28) 

Note that in the exponent on the R.H.S. of (28), the linear terms in Ai are absent as 
the derivative of the exponent vanishes at 7 = 1 /2. As the deviation around the term 
S{n/2) is 0(\/n) and the approximation is valid for Ai = 0(y/n\n(n) l l y ), the dominant 
contribution comes from -6>(V») < Ai < 0{^f{n)). We are now ready to obtain the 
estimate for the second moment. 

f281 - A ' 2 

E{N 2 (a,n))= £ S{n/2)e ^(l+o(l)), (29) 

Ai=— c^fn 

= S(n/2) / e ^dS(l+o(l)). (30) 

J S=— 00 

= S{n/2)^2im<j 2 {\+o{\)). (31) 

We can replace the sum by an integral by choosing sufficiently large c. Using the second 
moment method given in (12) and combining Lemma 3, (27), and (31), we obtain 



E(N(a n) 2 ) 2 \ \ B f( x k,xj ,x k )\ 
E(N(a,n) z ) (j s b q {x k )y/k 



Letting n go to infinity, we obtain (22). Clearly, if the supremum of the growth rate of 
S(jn) is not achieved at 7 = 1/2, then the lower bound given by the second moment 
method converges to zero. This gives the desired lower bound on the threshold. □ 

We can easily extend this result to the 2-regular case. In the following theorem we 
accomplish this task. Due to space limitation, we omit explanation of some steps which 
can be found in [17]. 

Theorem 3. Consider the 2-regular random k-SAT model where the number of vari- 
ables with degree r (resp. r + 1) is n T = A r n (resp. n T +\ = A r+ \n). Let 5(/ r ,; r +i) = 
S(7r«r,7r+l«r+l) be the summation term on the R.H.S. of (15) corresponding to over- 
lap i r (resp. i r +i) on the degree r(resp. r + 1) literals. Let g(7 r ,7r+i) be the growth rate 
o/5(7 r n r ,7 r+ i« r+ i) i.e. g(7 r ,7r+i) = lim„^oo in(s(»r7r,»r+i)) _ y 

>g(7r,7r+l),7r G [0,l],7r+l G [0,1], 7r ^ ^7r+l ^ ^, 

then with positive probability a randomly chosen formula has a solution. More pre- 
cisely, 



\Bf(x k ,xl,x k )\A r A r+l 

limP(Ma,n) > 0) > — ■ == . (33) 

b q {x k )VW\ 

The definition of x k , b q (x k ), and Bf(x kl x\ ) x k ) is same as in the Theorem 2. The 2x2 
matrix S is defined via, 

C / = [-l,l,-l].(B / (^,4,^))- 1 .[-l,l,-l] r , A = X +2r2 (S"^)' 

4 2(r + l) 2 /C/ 4\ 2r(r + l)/C/ 4\ , \A B~}~ ] 

fl = — + - L - J -|^-r).C=-^ f -t ),thenZ= BQ 



A T+ \ a \2 k 

(34) 

The threshold a* is lower bounded by a], where ol[ is defined by 
a] = sup ja:gQ,0 > g(7r,7r+i),7r G [0,l],7r+i G [0,1], 7 r 7^ i,7 r+ i ^ i|. 

Proof. Define r(7 r ,7 r+ i) = r/l r 7 r + (r + l)yl r+ i7 r+ i. Then by using Theorem 1 
and Stirling's approximation, we obtain 

g(7r,7r+i) = ln(2) +A r h(-/ r ) + A r+l h(~/ r+ i) 

I ka 

+ (A:a-2r( 7r ,7 r+1 ))ln — - r( 7r , 7r+ i) 



^ 2 

+ 2(r(7 r , 7r+1 ))ln(r(7 r , 7r+1 ))-toln(to) 



+ aln (/((»)))- (y-r( 7r ,7 r+ i))ln( W3 )-(r(7 r ,7 r+ i))ln(? 2 ), (35) 



where I = {/i,^^} is a positive solution of the saddle point point equations as given 
in Theorem 1 , 



Q (t) = [ k r (7r,7r+l) -T(7r,7r+l) k r^Tr+l) \ 

\ 2 a a ' 2 a J 



(36) 



corresponding to the coefficient term of power of f(xi,X2,X3). In order to obtain the 
maximum exponent, we take the partial derivatives of g(7 r ,7r+i) with respect to 7r 
and 7 r+ i and equate them to zero. This gives the following equations. 

Infl^A 2rlnf^-r( 7r , 7r+ i)V 2rln ( r (7r,7r + i)) + rlnf^ =0, 



2- \ in i±~r± / i i \- v i * •) /iTi// 1 i , 
/ V f 2 

ln(i^±i)-2(r + l)ln(^-r (7r , 7r+1 ) 

+ 2(r + l)ln(r( 7r , 7r+1 ) + (r + l)ln^^ =0. (37) 

Note that t\ = tj, = x^, = x\, 7r = j, and 7r+ i = | is a solution of (36), (37), which 
corresponds to S ^f 1 ), where x<- is the solution of the saddle point equation cor- 
responding to q(x) defined in Lemma 3. We recall that for the second moment method 
to work, the maximum exponent should be equal to twice the exponent of the aver- 
age number of solutions. Indeed by the proposed solution, the term S , ^r 1 ) has an 
exponent which is twice that of the average number of solutions. If this is also the max- 
imum, then we have the desired result. Assuming that S {^f, has the maximum 
exponent, we now compute the second moment of the total number of solutions. By 
using Stirling's approximation and the local limit result of Theorem 1, we obtain 

S(/ r + A r ,/ r+ i+A r+1 ) =e -^[^ r ,^ r+1 ].i:-'.[A r ,A r+1 f (3g) 

S (z' r ,z'r+l) 

where the matrix S is defined in (34). Using the same series of arguments as in Theorem 
2, we obtain 



E(N(a,n) 2 )= £ S ^±1) e ~ U^Ai, + ^~ l -[^Ai^^ m 

Ai r ,Ai r+l V 2 2 1 

=5 (y' ! r i ) /~/^^ [j:r ' JCr+ll -^ Mj:r ' j;r+lir ^x r+1 (i+ (i)), (40) 

, 3 

= Vl^l fcI e 2«(ln(2)-faln(2)+aln(pfa))-^lnfa))( 1+0 ( 1 ^ ^ 

A^\Bf(x k ,X 2 k ,X k )\A r A T+ ]_ 

By using the second moment method, we obtain the bound given in (22). Note that 
the second moment method fails if the term S {^f , ^ ti ) is n °t the dominant term. This 
gives the lower bound a] on a* . □ 

In the next section we discuss the obtained lower and upper bounds on the satisfia- 
bility threshold and NAE-satisfiability threshold. 



4 Bounds on Threshold 



In Table 1, lower bounds and upper bounds for the satisfiability threshold are given. 
The upper bound is computed by the first moment method. As expected, we obtain 
the same upper bound for regular random 3-SAT as given in [6]. The lower bound is 
derived by the second moment method for strictly regular random A:-SAT. In order to 
apply the second moment method, we have to verify that s(j), defined in (24), attains its 
maximum at 7 = j over the unit interval. This requires that 7 = \ is a positive solution 
of the system of equations consisting of (25) and (26) and it corresponds to a global 
maximum over 7 e [0, 1]. Also, of defined in (23) should be positive. The system of 
equations (25), (26) is equivalent to a system of polynomial equations. For small value 
of k, we can solve this system of polynomial equations and verify the desired conditions. 
In Table 1 this has been done for k = 3,4. The obtained lower bound for 3-SAT is 2.667 
which is an improvement over the algorithmic lower bound 2.46 given in [6]. For larger 
values of k, the degree of monomials in (26) grows exponentially in k. Thus, solving 
(25) and (26) becomes computationally difficult. However, s(j) can be easily computed 
as its computation requires solving only (25), where the maximum monomial degree 
is only k. Thus, the desired condition for maximum of 5(7) at 7 = 5 can be verified 
numerically in an efficient manner. 

Note that the difference between the lower bound obtained by applying the second 
moment method to the strictly regular case can differ by at most 2/k from the corre- 
sponding lower bound for the 2-regular case. We observe that as k increases the lower 
bound seems to converge to 2 k In (2) — (k + 1 ) — 1 , which is the lower bound for the 
uniform model. 

We observe similar behavior for the NAE-satisfiability bounds. As expected, we 
observe that the upper bound on the NAE-satisfiability threshold for the regular ran- 
dom model converges to 2 fc_1 ln(2) — ^p-. The lower bound obtained by applying the 
second moment method to the regular random model seems to converge to the value ob- 
tained for the uniform model. Thus, the NAE-threshold for the regular random model is 
2 k ~ x ln(2) — 0(1). This suggests that as k increases, the threshold of the regular model 
does not differ much from the uniform model. 



k 


r* 


a* 






r NAE 


a (,NAE 


a «,NAE 


3 


4 


2.667 


3.78222 


0.492216 


3 


2 


2.40942 


4 


16 


8 


9.10776 


0.357487 


8 


4 


5.19089 


7 


296 


84.571 


85.8791 


0.378822 


152 


43.4286 


44.0139 


10 


3524 


704.8 


705.9533 


0.170403 


1770 


354 


354.545 


15 


170298 


22706.4 


22707.5 


0.101635 


85167 


11355.6 


11356.2 


17 


772182 


90844.94 


90845.9 


0.007749 


386114 


45425.2 


45425.7 



Table 1: Bounds on the satisfiability threshold for strictly regular random k-SAT. a* t and 
r* are defined in Theorem 2. The upper bound a* u is obtained by the first moment method. 
a *lani = 2*In(2) — (k+ 1)^5^1 — 1 i s lower bound for uniform model obtained in [3]. The quan- 
tities r£j AE , a/ N ae> o^.NAE are analogously defined for the NAE-satisfiability. 



Our immediate future work is to derive explicit lower bounds for the regular ran- 
dom fc-SAT model for large values of k as was done for the uniform model in [1,3]. 
The challenge is that the function s(j) depends on the solution of the system of poly- 
nomial equations given in (25). Thus determining the maximum requires determining 
the behavior of the positive solution of this system of polynomial equations. Another 
interesting direction is the maximum satisfiability of regular random formulas. For the 
uniform model, the maximum satisfiability problem was addressed in [2] using the sec- 
ond moment method. In [16], authors have derived lower and upper bounds on the 
maximum satisfiability threshold of regular random formulas. 
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